SeNTU: Sentiment Analysis of Tweets by Combining a Rule-based Classifier with Supervised Learning
نویسندگان
چکیده
We describe a Twitter sentiment analysis system developed by combining a rule-based classifier with supervised learning. We submitted our results for the message-level subtask in SemEval 2015 Task 10, and achieved a F1-score of 57.06%. The rule-based classifier is based on rules that are dependent on the occurrences of emoticons and opinion words in tweets. Whereas, the Support Vector Machine (SVM) is trained on semantic, dependency, and sentiment lexicon based features. The tweets are classified as positive, negative or unknown by the rule-based classifier, and as positive, negative or neutral by the SVM. The results we obtained show that rules can help refine the SVM’s predictions.
منابع مشابه
یک چارچوب نیمهنظارتی مبتنی بر لغتنامه وفقی خودساخت جهت تحلیل نظرات فارسی
With the appearance of Web 2.0 and 3.0, users’ contribution to WWW has created a huge amount of valuable expressed opinions. Considering the difficulty or impossibility of manually analyzing such big data, sentiment analysis, as a branch of natural language processing, has been highly considered. Despite the other (popular) languages, a limited number of research studies have been conducted in ...
متن کاملImproved Optimized Sentiment Classification On Dynamic Tweets
Real time Sentiment analysis is a subfield of Natural Language Processing concerned with the determination of opinion and subjectivity in a text, which has many applications. In this paper, classifiers for sentiment analysis of user opinion towards through comments and tweets sing Support Vector Machine (SVM) is described. The goal is to develop a classifier that performs sentiment analysis, by...
متن کاملOn Classifying the Political Sentiment of Tweets
For this project, we attempted to classify the political sentiment of tweets containing the case-insensitive string ‘Obama’ in an effort to automatically gauge the public opinion of US President Barack Obama. To accomplish this goal we investigated rule-based, supervised, and semi-supervised learning methods. Our main approach involved bootstrapping an ngram-feature-based maximum entropy classi...
متن کاملAnnotate-Sample-Average (ASA): A New Distant Supervision Approach for Twitter Sentiment Analysis
The classification of tweets into polarity classes is a popular task in sentiment analysis. State-of-the-art solutions to this problem are based on supervised machine learning models trained from manually annotated examples. A drawback of these approaches is the high cost involved in data annotation. Two freely available resources that can be exploited to solve the problem are: 1) large amounts...
متن کاملSentiment Analysis of Political Tweets: Towards an Accurate Classifier
We perform a series of 3-class sentiment classification experiments on a set of 2,624 tweets produced during the run-up to the Irish General Elections in February 2011. Even though tweets that have been labelled as sarcastic have been omitted from this set, it still represents a difficult test set and the highest accuracy we achieve is 61.6% using supervised learning and a feature set consistin...
متن کامل